FAQs

For training models we support the following types of datasets:

chat: Like RyokoAI/ShareGPT52K or WizardLM/WizardLM_evol_instruct_V2_196k

// Type: `chat`
[
	{"conversations": [{"from": "system", "value": "..."}]},
	{"conversations": [{"from": "human", "value": "..."}]},
	{"conversations": [{"from": "gpt", "value": "..."}]},
	{"conversations": [{"from": "human", "value": "..."}]},
	{"conversations": [{"from": "gpt", "value": "..."}]},
	...
]
// "from" can only be one of "system", "human", "gpt"
// There can only be 1 "system" in starting of the conversation
// After that, "human" and "gpt" alternate every message starting with human

instruct: Like vicgalle/alpaca-gpt4 or tatsu-lab/alpaca

// Type: ``instruct``
[
	{"instruction": "...", "input": "...", "output": "..."},
	{"instruction": "...", "input": "...", "output": "..."},
	{"instruction": "...", "input": "...", "output": "..."},
	...
]
// Instruction is the task LLM needs to do
// Input is the input to the task
// Output is the expected generation

completion: Like Oasst or Wikitext

// Type: `completion`
[
	{"text": "..."},
	{"text": "..."},
	...
]

For more help in dataset formats please check out this page.

To use Studio through LangChain, you can follow these steps

Install langchain openai

pip install langchain-openai

Export studio key as OpenAI key

export OPENAI_API_KEY="<Tune Studio API Key>"

Init ChatOpenAI()

from langchain_openai import ChatOpenAI

llm = ChatOpenAI(
    base_url="https://proxy.tune.app/",
    model="rohan/tune-gpt4",
)

llm.invoke("how can langsmith help with testing")

Profit 💯 Try building a translator app using LangChain + Tune + Streamlit Follow the LangChain tutorial here

To use Studio through LLama Index, you can follow these steps

Install llama-index

pip install llama-index
pip install llama-index-llms-openrouter

Export studio key as OpenAI key

export OPENROUTER_API_KEY="<Tune Studio API Key>"

Init llm using OpenRouter

from llama_index.llms.openrouter import OpenRouter

llm = OpenRouter(
    api_base="https://proxy.tune.app",
    max_tokens=256,
    model="rohan/mixtral-8x7b-inst-v0-1-32k",
)

response = llm.complete("Paul Graham is ")
print(response)

Profit 💯 Follow along with the LlamaIndex tutorial here

Getting Started

Concepts

Miscelaneous

Organization

Datasets

Models

Finetune Jobs

Integrations

Additional

Getting Started

Concepts

Miscelaneous

​Organization

​Datasets

​Models

​Finetune Jobs

​Integrations